Numeration systems on a regular language : 
Arithmetic operations, Recognizability and Formal 

power series 

Michel Rigo 

Institut de Mathematiques, Universite de Liege, 
Grande Traverse 12 (B 37), B-4000 Liege, Belgium. 
M . RigoOulg . ac . be 

January 17, 2000 
Abstract 

Generalizations of numeration systems in which N is recognizable 
by a finite automaton are obtained by describing a lexicographically 
ordered infinite regular language i C S*. For these systems, we obtain 
a characterization of recognizable sets of integers in terms of rational 
formal series. We also show that, if the complexity of L is 9(n') (resp. 
if L is the complement of a polynomial language), then multiplication 
by A e N preserves recognizability only if A = /J'"*"^ (resp. if A ^ 
(#1])'^) for some /3 G N. Finally, we obtain sufficient conditions for 
the notions of recognizability and f7- recognizability to be equivalent, 
where U is some positional numeration system related to a sequence 
of integers. 

1 Introduction 

According to Q, a numeration system is a triple S = (L,S,<) where L 
is an infinite regular language over a totally ordered alphabet (S, <). The 
lexicographic ordering of L gives a one-to-one correspondence rs between 
the set N of natural numbers and the language L. A subset X C N is called 
S -recognizable if "csiX) is a regular subset of L. 

We first characterize the 5-recognizable subsets of N in terms of rational 
series in the noncommuting variables o" € S and with coefficients in N. In 
particular, we show that X^„gpi} nisin) is rational (this kind of result is also 
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discussed in Using classical results about rational series, we obtain 

a generalization of the fact given in Q that ultimately periodic sets are 
S-recognizable for any numeration system S. 

Our main purpose is related to the stability of the S'-recognizability un- 
der arithmetic operations like addition and multiplication by a constant. If 
addition preserves the S'-recognizability then multiplication by 2 also pre- 
serves the 5-recognizability. So, a natural question about the stability of 
the recognizability arises. When does the multiplication by an integer A 
preserve the recognizability ? 

It is well known that for positional numeration systems in base p the 
problem of addition and multiplication by a constant is completely settled. 
The p-recognizable sets are exactly those defined in the first order structure 
(N, +, Vp) (see for instance [§, ^). It is obvious that addition and multipli- 
cation by a constant are definable in the Presburger arithmetic. Therefore, 
p-recognizability is preserved. 

On the other hand, using the specific structure of the language a*b*, 
it is shown in § that for the numeration system S = {a*b*,{a,b},a < b), 
the multiplication by a non-negative integer A transforms the S-recognizable 
sets into S-recognizable sets if and only if A is a perfect square. Then the 
multiplication by 2 does not preserve S-recognizability. 

Notice that the language a*b* has a polynomial complexity (the com- 
plexity function piin) of a language L counts the number of words of length 
n in L). So, it is natural to check whether a numeration system on a poly- 
nomial language preserves the recognizability of a set after multiplication by 
a constant. For a*b* , perfect squares play a special role. Does there exist a 
similar set for an arbitrary language in O(n') ? We get the following result: 
if 5" is a numeration system built on a regular language with complexity 
in G(n') then the multiplication by A preserves the recognizability only if 
A = Z?'^^ for some integer /3. As a consequence, the addition cannot be a 
regular map for numeration systems on polynomial regular languages. 

In order to prove this, we proceed in two steps. In section we assume 
that the complexity of the language is a polynomial of degree / with rational 
coefficients. With such a language, we exhibit a subset X which is recogniz- 
able and we prove that AX is not recognizable for any A G N\{n'+^ : n G N}. 
In section ^, we consider the general case. 

In this study of polynomial regular languages, we have obtained a inter- 
esting result about a special sequence associated to a language. We denote 
by VL^n), or simply Vn if the context is clear, the number of words of length 
not exceeding n belonging to L. In section |5|, we show that if the complexity 
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of L is 0(n ), then the sequence (f„/n converges to a strictly positive 

limit. It is surprising to notice that, in contrast, the sequence (/9L(n)/n')„gN 
generally does not converge. 

The end of this paper is mainly related to exponential languages. In 
section |6|, we consider numeration systems on the complement of a polyno- 
mial language. As in the polynomial case, we find a recognizable set X and 
constants A such that AX is not recognizable. Here, the A's are powers of 
the cardinality of the alphabet. 

In the last section, we study relations between some positional numera- 
tion system U and a system S* on a regular language L. We give sufficient 
conditions for the equivalence of S'-recognizability and {7-recognizability. 
These conditions are strongly dependent on the language L and the recog- 
nizability of the normalization in U . Using these conditions, we give two 
examples of numeration systems on an exponential language such that ad- 
dition and multiplication by a constant preserve S-recognizability. 

2 Basic definitions and notations 

We denote by S* the free monoid (with identity e) generated by S. For a 
set 5, is the cardinality of S and for a string u; G S*, l^l is the length 
of w. 

Let L C S* be a regular language; the minimal automaton of L is a 5- 
tuple Ml = {K, s, F, S, 6) where K is the set of states, s is the initial state, 
F is the set of final states and 5 : K x Ti ^ K is the transition function. 
We often write k.a instead of 6{k, a). Recall that the elements of K are the 
derivatives |0, III. 5] 

w-\L = {ve^* : wveL},wG S*. 

The state k is equal to w"^ .L if and only if A; = s.w; ur^.L being then the 
set Lfc of words accepted by Ml from k. In particular, L = Lg. 

We denote ui{k) the number #{Lk n S') of words of length / belonging 
to Lfc and vi{k) the number of words of length at most / belonging to L^, 

I 

vi{k) = y^ni(fc). 

Notice that the notations L^, ui{k) and vi{k) are relevant to any DFA (de- 
terministic finite automaton) accepting L. 
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The lexicographic ordering can be used to compare words of different 
lengtli. Let x and y be two words. We say that x < y if \x\ < \y\ or if 
|x| = \y\ and there exist letters a < f3 such that x = wax' and y = wf3y' . 

An extension of numeration systems in which the set of representations 
is regular is the following. 

Definition 1 A numeration system is a triple {L, S, <) where L is an infi- 
nite regular language over a totally ordered finite alphabet (E, <) (see 
The lexicographic ordering of L gives a one-to-one correspondence rs be- 
tween the set N of natural numbers and the language L. 

For each n G N, vs{n) is the (n -|- 1)*'* word of L with respect to the 
lexicographic ordering and is called the S -representation of n. For w G L, 
we set vals{w) = r^ (w) and we call it the numerical value of w. 

The mappings valg and r^ are sometimes called ranking and unranking 
in the literature. 

This way of representing integers generalizes linear numeration systems 
in which N is recognizable by finite automata. Examples of such systems are 
the numeration systems defined by a recurrence relation whose characteristic 
polynomial is the minimum polynomial of a Pisot number (i.e. an algebraic 
integer a > 1 such that its Galois conjugates have modulus less than one) 
[0]. (Indeed, with this hypothesis, the set of representations of the integers 
is a regular language.) The standard numeration systems with integer base 
and also the Fibonacci system belong to this class. 

Definition 2 Let be a numeration system. A subset X of N is S- 
recognizahle if rs(X) is recognizable by a finite automaton. 

Let S = (L, S, <) be a numeration system. Each k €z K for which L^. is 
infinite leads to the numeration system Sk = (-Lfc,E,<). The applications 
^^^Sk simply denoted r^ and valfc if the context is clear. If is 
finite, the applications r^ and valfc are defined as in the infinite case but the 
domain of the former restricts to {0, . . . , ^-Lfc ~ !}• 

With these notations, we can recall a very useful proposition. 

Lemma 3 Let S = {L, S, <) and M = [K, s, F, S, 5) be a DFA accepting 
L. If aw belongs to L^, k£K,aGT,,w£ S^, then 

valkiaw) = Malk,„{w) + v\^\(k) - v\^\_i{k.a) + ^ u\^\{k.a'). □ 

(7'<(7 
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3 Recognizable formal power series 

Let i? be a semiring, a formal power series T : T,* ^ R can be written as a 
formal sum 

T = Y {T,w)w. 

We mainly adopt the terminology of [l| concerning semirings, rational and 
recognizable series. Recall that for each word n E S* and for each formal 
series T, one associates the series u~^T defined by 

u~^T = (T, uw) w. 
weT:* 

In other words, {u~^T,w) = {T,uw). 

It is shown in Q that the series Ylw^x* ''^2iw) w G N((x)) is rational. In 
the last expression, X is the alphabet {xo,xi} and if w = Xi^ ■ ■ ■ Xig then 
7r2(w) = 2'^ ijt + • • • + 2 ii + io is the numerical value in base two of w. 

Here, we obtain the same result for any numeration system on a regular 
language. Another proof of this result can be found in where complexity 
problems are discussed. 

Proposition 4 Let S = (L,S,<) be a numeration system. The formal 
series 

Ts = ^ val5(u;)u; G N((S)) 

w<^L 

is recognizable. 

Proof. Let Ml = {K,s,F,T,,6) be the minimal automaton of L. For 
k,l £ K , a €z Ti, we introduce the following series of N((S)) 



Tk 


= Y [valfc(u') - 




















w€Li 
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^ ^ f [valfc (cj) - vo{k)] e if a £ Lk 
\ otherwise. 

If/CjZG-fT, a,cJGS, then we have the fohowing relations 

i) a~^Tk = Tk.a + ^ Uk.a,k.a' + W^fe,cr 
ct'<ct 

To check relation z), one has to compute {Tk,aw). Notice that aw E Lk 
iff w; G -Z^fc.cr- Use Lemma ^ and treat the case w = e separately. 
For relations ii) and iii), if aw belongs to Li then w G Lia and 

a&T. 

In iu), one observes that = v\^\_i{k) Relation v) is 

immediate. 

Therefore the submodule IZ of N((S)) finitely generated by the series 
^fc's, Ui^kS, f//fc's, Vz^fe's, VFfc^o-'s is stable for the operation T i— > (T~^T, 
o" G S. By associativity of the operation T i— > w~^T, this module is stable. 
By Prop. 1, p. 18], the series of TZ are recognizable. 

To conclude the proof, notice that 

Tk + Vk^k = E '^^^k{w) w = ^ valfc(u;) w. 
Indeed, if e G then valA;(e) = 0. □ 

Example 1 We consider the numeration system S = {a*b*,{a,b},a < b). 
We obtain a linear representation (A, /U, 7) for J^s '■ 

/I 1 0\ /II 1\ /O 

A = (1 0) , fi{a) = 1 1 , fiib) = 1 1 , 7 = 1 
\001/ \001/ \1 

where ^ : {a,b}* ^ N^^^ is a morphism of monoids. Thus one has 

val5(w) = A fJ,{w) 7. 
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Inspired by the definition of [/-automata given in we have the fol- 
lowing characterization of the regular subsets of a regular language. 

Lemma 5 Let L C T,* be a regular language and Ml = {Ql,sl,Fl,'^,Sl) 
be its minimal automaton. If Mk = {QktSk,Fk,'^,Sk) is the minimal 
automaton of a regular language K C L then there exists a morphism h of 
automata between Mk and Ml defined as follows 

h : Qk Ql, 
h{dK{q,cr)) = 5L{h{q),a), cj G S, q G Qk, 

< h{sK) = SL, 

. HFk) C Fl. 

Proof. A state of Mk is a derivative of K of the form 

u'^.K = {v -.uv £ K}. 

Since K C L, then u^^.K C u^^.L. We consider the morphism h : Qk — > 
Ql defined by h{q) = u^^.L if g = u^^.K for some u. We can verify the 
properties of h using the definition of the minimal automaton |0, III. 5], 

1. 6KiQ,(^) = cr^^.q = a^^ .{u^^ .K) for some u G S*, q G Qk, o" G S. So 
Sk{q,(^) = {ua)^^.K and {ua)^^.L = a^^.h{q) = 5L{h{q),a). 

2. sk = e~^.K and sl = e~^.L. 

3. A state q = u^^.K belongs to Fk if e G u~^.K therefore e G u^^.L 
and h{q) = u^^.L G Fl- 

□ 

With this lemma, we can generalize Proposition ^ and obtain a charac- 
terization of the S'-recognizable sets. 

Theorem 6 Let S = (L,S,<) be a numeration system, a set X C N is 
S -recognizable if and only if the formal series 

val5(u')w G N((S)) 

is recognizable. 
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Proof. The condition is sufficient. The support of a recognizable series 
belonging to N((S)) is a regular language |l|, Lemme 2, p. 49]. 

The condition is necessary. By Lemma ^, one has a morphism h : Mx 
Ml where Mx (resp. Ml) is the minimal automaton of ics{X) (resp. L). 
We proceed as in the proof of Proposition ^. Let K be the set of states of 
Mx', for k,l £ K , a £ Ti, we introduce the following series 

X] [valh{fc)(u^) - v\y,\_i{h{k))] w 

X u\^\{h{k))w 

^u\^\{h{k))w 
w£Li 

X v\^\^i{h{k)) w 

[val/j(fc) (a) - VQ{h{k))] e if a £ Lk 
otherwise. 

We conclude as in Proposition ^. □ 

In Q, it is shown that for any numeration system S, arithmetic pro- 
gressions are always S'-recognizable. Using formal series, we can obtain a 
generalization of this result. Here, the language L is not necessary lexico- 
graphically ordered. 

Proposition 7 Let L C S* be an infinite regular language and a : L — > N 
be a one-to-one correspondence. If 

T= ^a{w)w GN((S)) 

is recognizable then a^^{p + Nq) is a regular language. 

Proof. Assume p = 0. Consider the congruence of the semiring (N, +, ., 0, 1) 
defined by n ~ n + g. We denote by M the finite semiring N/~ and by 
ip the canonical morphism : N — > M. The characteristic series of L, 
L = YIiw&l''^^ is recognizable (see Prop. 1, p. 51]). So 

U = <p{T + L) = Y^ >p{a{w) + l)w£ N{{T)) 
weL 



Tk = 

Ui,k = 

Kk = 

Vi,k = 
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is rational (see Lemme 1, p. 49]). Since J\f is finite and U is rational, the 
set 

U-\{ip{l)]) ={weY.*: {U,w) = 99(1)} = a-\nq) 

is a regular language (see Prop. 2, p. 52]). 

If p 7^ and p < q, then consider the series U = ip{T) and the set 

u-\W{p)]). □ 

Corollary 8 Arithmetic progressions are S -recognizable for any numeration 
system S. 

Proof. This is a direct consequence of Propositions ^ and |^ □ 

Remark 1 One can easily characterize the congruences ~ of the semiring 
(N, +, ., 0, 1) with finite index q> 1. The canonical morphism is denoted by 
(p. 

First notice that (/?(0) 7^ 93(1)- Since N/~ is finite, there exist x,y E N 
such that X + y ~ X. Let 

yo = min{y > 0|3x : x ~ x + y} and xq = min{x|x ~ x + yo}. 

For all n G N and i = 0, . . . ,yo — 1) one has XQ + i ~ xo + i + nyo- It is obvious 
that if 7/0 > 1 then for j G {0, . . . , yo — Iji ^ 7^ i) one has xq + i 9^ xq + j. 
By definition of xq and y^, if z < xq then ip^'^(p{z) = {z} 

Therefore the congruences of N with finite index are generated by the 
relation n ~ n + yo for n sufficiently large. So we cannot refine Proposition 
^ with the same kind of proof because it uses explicitely the finiteness of 
N/~. 

4 Multiplication for exact polynomial languages 

In [^, we proved that for the numeration system S = (a*6*, {a, 6}, a < 6), 
the multiplication by a non-negative integer A transforms the S-recognizable 
sets into 5-recognizable sets if and only if A is a perfect square. 

In this section, we study the family of regular languages with polynomial 
complexity function. This step contains the main ideas leading to the case of 
an arbitrary polynomial language (i.e. a language with complexity function 
bounded by a polynomial). But it is simpler to handle since we only deal 
with polynomials. 
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Lemma 9 Let f : N ^ N be a strictly increasing function such that /(N) 
is a finite union of arithmetic progressions (i.e. there exist yo and F such 
that Vy >yo,ye /(N) <^y + Te f{n)). 

Let k = f-\yo + r) - f-\yo). For all x > f-\yo), n G N, 

f{x + nk) = f{x) + nr. 

Proof. Let xq = f^^{yo). We have by definition of k, 

f{xo + k) = f{f-\yo) + k) = fif-\yo + T)) = /(xq) + F. 

It is sufficient to sliow tliat if a; > xq tfien 

fix + k)= fix) + F ^ fix + k + 1) = fix + I) + F. 

Since / is strictly increasing, /(x + k + 1) > fix + k) = fix) + F. Since the 
characteristic sequence of /(N) is ultimately periodic, there exists v > xq 
such that fiv) = fix + k + l)-F > fix). Then v > x + 1. There exists 
n G N such that /(n) = /(.x + 1) + T > fix) + F = fix + k). 
Now, assume that v > x + 1. Therefore fiv) > fix + 1) and 

fix + k + i) = fiv) + r > /(x + 1) + r = fiu) > fix + k). 

So we have x + k + 1 > u > x + k which is a contradiction and v = x + 1. 
□ 

Definition 10 The complexity function of a language L C S* is 

In the following, wc assume that we deal with "true" complexity functions, 
i.e. if pL is a polynomial belonging to Q[x] and n G N then piin) is a 
non- negative integer. We equally use the notation piin), n„(s) or even u„ 
provided the context is clear. 

The next lemma will be useful when applied to a complexity function. 

Lemma 11 If H is a polynomial such that E N \ {0}, i?(n) G Z then 
HiZ) C Z. 
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Proof. We proceed by induction on the degree of H. If is a polynomial 
of degree one then one has H{n) = an + b with a,b £ 1^ and H{'L) C Z. 

Assume that the result holds for polynomials of degree k > 1. If -ff is 
a polynomial of degree /c + 1, then there exists a polynomial R of degree k 
such that Vn > 1, R{n) = H{n + 1) - H{n) G Z. Therefore R{Z) C Z and 
-ff(O) = H[l) — R{0) £ Z. We can conclude by induction on n < because 
H{n) = H{n+l) - R{n). □ 

Theorem 12 Let L C T,* be a regular language such that 

ain^ + ■ ■ ■ + ain + ao if n > 



PL[n) 



1 otherwise 



where the ai 's belong to Q and ai > 0. Let -< be an ordering of the alphabet 
S and S = (L,S,-<) be the corresponding numeration system. 

// A G N \ {n''^^ : n G N}, then there exists a subset X o/ N such that 
r5(X) is regular and that is{^X) is not. 

Proof. One can build a polynomial P G Q[x] of degree / + 1 such that 
P(0) = and for all n > 1, P{n + 1) = P{n) + pL{n). 

Indeed, let P{x) = 6^+1 + • • • + 6i x + 6o. The conditions on P gives 
the following triangular system 

ai = k+i {I + 1) 
ai-i = bi+i{l + l)^ + bil 

ao = bi^i H h 6i 

^ bo =0. 

This polynomial P has some useful properties. We have the polynomial 
identity P{x + 1) = P{x) + pl{x) for x G N \ {0}. Then it holds for x G M 
if we extend the definition of to : M ^ M : x ^ a/ x' + • • • + ao- By 
Lemma ^l], -P(l) = Pl{0) = ag G Z. One shows by induction on n G N that 
P{n) (resp. P{—n)) is an integer since Pl(N) C N (resp. since Pl{'^) C Z 
by Lemma |ll]) . 

Let X G N \ {0}, notice that 

|r5(x)| = n <^ X G [P{n) - ao + 1, P{n + 1) - ao]. (1) 

Indeed, an integer x has a representation of length n if Vn-i < x < w„ and 

n n 

Vn = Y.PLi^ + ^ = + 1) - ^(^)] + 1 = ^(^ + 1) - ^(1) + 1- 

i=l i=l 
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Notice that r5(P(N)) is a translation of the set X(L, <) of the first words of 
each length. Therefore X = P(N) is 5-recognizable, see 13|. 

Let A G N\{0, 1}. Our aim is to show that AP(N) is not S'-recognizable. 

For n large enough, we first show that 

n < \rs{XP{n))\ < A^/'n. 

The first inequality is obvious. In view of (||), to satisfy the second inequality, 
one must check whether 

AP(n) <P(A^/'n)-ao + l. 

We can write P{n) as n'+^ + Q{n) with bi+i > and Q being a poly- 
nomial of degree not exceeding I. Then, 

P(Ai/' n)-AP(n)-ao + l 
= (A^/' n)'+i - A 6^+1 n'+i + Q(aV' n + 1) - A Q(n) - + 1. 

The coefficient of n''^^ is bi^i (A^'^-*^)/' — A) > 0. So, there exists no such that 
for all n > riQ, this polynomial expression of degree ^ + 1 is strictly positive 
and |r5(AP(n))| < A^/' n. 

If n is sufhciently large, we show that 

|r5(AP(n + l))| > |r5(AP(n))|. 

Let i = |rs'(A P(n)) I . In view of (||), one has to verify that 

AP(n + l) >P(i + l)-ao. 

By definition of P and by one has 

A P(n + 1) = A P(n) + A pUn) > P{i) - oq + A pL{n). 

Therefore it is sufficient to check whether P(i) — ao + A PL(n) > P(i + 1) — ao, 
which occurs if and only if 

A pL{n) - pL{i) = ai (An' - i^) + ■ ■ ■ + {Xn^ - z^^) + • • • + oq (A - 1) > 0. 
To verify that this inequality holds, remember that a/ > and for n > no, 



1 < ^ < A^/'. Thus one studies the quotient 



Apl(w)-Pl(») 



when n — > +00, 



ai 



+ ••• + 



l-k 



n' 



+ 



>o 



is bounded 



+ ^ (A-1). 

n' 



^0 
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So there exists u'q > uq such that for all n > Uq, |r5(AP(n + 1))| > 
|rs(AP(n))|. 

Assume that rs(AP(N)) is regular then the set |r5(AP(N))| is a finite 
union of arithmetic progressions. We may apply Lemma indeed, the 
function |r5(AP(.))| is strictly increasing in {n : n > Hq} and there exist Iq 
and Tx (simply written T) such that V/ > Iq, I ^ |rs(AP(N))| 44> / + T G 
|rs'(AP(N))|. Let ni > n'g be such that |r5(AP(ni))| > Iq. By Lemma ^, 
there exists k\ (simply written k) such that for all n > ni and for all a G N, 

\Ts{XP{n + ak))\ = |r5(AP(n))| +ar. 

Let i = |r5'(A P(n))|. In view of one has 

P{i + ar) - ao + 1 < A P{n + ak) < P{i + aT + l)- oq. 

Since A P{n + ak) — P{i + aV) + oq — 1 must be positive for all a G N, the 
coefficient of the greatest power of q, a'"*"^, must be strictly positive. This 
coefficient is 

and we have the condition 

r 

k > 



AVa+i) ■ 



Notice that the coefficient vanishes only if A = (^)'^^- By hypothesis, this 
case is excluded (notice that ^ G Q\N =^ (f )''^^ N). 

But \P{n + ak) — P{i + aV + 1) + ao must be negative for all a G N. 
The coefficient of the greatest power of a is also A 6^+1 k^^^ — r'+^ and 
must be strictly negative. Then we have simultaneously the condition 

r 

k < 



AVa+i) ' 

which leads to a contradiction. □ 



In Theorem 12, we exhibit a recognizable set X = P(N) such that 
|r5'(AP(N))| is not a finite union of arithmetic progressions. When we con- 
sider the case A = Z?'"*"^, (3 G N \ {0, 1}, we cannot find easily a subset X 
which is recognizable and such that AX is not. 

The next proposition shows that |r5(/3'"'"^P(N))| is a finite union of arith- 
metic progressions whether is a polynomial of degree I. 
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Proposition 13 With the assumptions and notations of Theorem li, there 
exists C G Z such that for n large enough, 

|rs(/3'+^P(n))| =l3n + C. 

Proof. In the proof of Theorem |l^, we introduced a polynomial P{x) = 
bij^i + • • • + 61 X such that P{n + 1) — P{n) = pL{n). In view of (|l|), we 
have to find an integer C such that for n large enough 

P{f5n + C + l)-aQ-l3^+^P{n) > (2) 
(3^+^ P{n) - n + C) + ao - 1 > 0. (3) 

The coefficient of v}~^^ vanishes in (^) and (^). The coefficient of v} in (|^) is 
(3^ [ai (C + 1) + hi (1 — /?)] with ai = bi^i (/ + 1). It is strictly increasing with 
C and equals zero for 

(j^Q ._ bi ai 

The same coefficient in (^) is — [a/ C + 6; (1 — It is strictly decreasing 
with C and equals zero for C = C2 := Ci + 1. 

If Ci and C2 are not integers then there exists C S]Ci, C2[nZ such that 
the coefficients of terms of maximal degree are both strictly positive. 

Otherwise, one has to consider the integer case C = Ci or C = C2 (it is 
obvious that any other C leads to a strictly negative expression for (^) or 
(I)). Moreover, if C = Ci (resp. C = C2) then d) (resp. (|)) is satisfied 
for n large enough. 

Notice that for i = 1, . . . ,1 — 1 the coefficient of n* in @ with C = Ci 
is the opposite of the coefficient of in (P) with C = C2 since C2 = Ci + 1. 
Notice also that the independent term in (§) for C = Ci is P{C2) — ao- In 
(^) for C = C2 this term is —P{C2) + oq — 1- Thus we can write (||) with 
C7 = Ci as 

A-i n^'^ + • • • + Am + P(C2) - ao 
and dH) with C = C2 as 

-A-i n^~^ Am- P{C2) + ao - 1. 

If there exists i such that Ai ^ then let j = max^.^o ^- If > (resp. 
j4j < 0) then one takes C = Ci (resp. C = C2). 

Now, assume that Ai = for i = 1,... — 1. If P{C2) — oq > 
then one takes C = Ci. Otherwise, —P{C2) + oq is a striclty positive 
integer (remember the properties of P obtained in the proof of Theorem 



ID. Therefore -P{C2) + ao - 1 > and one takes C = C2. □ 
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5 Multiplication and polynomial languages 



Here we obtain the generahzation of Theorem |T2| for an arbitrary regular 
language of polynomial complexity. In the same time, we show that the 
sequence {vn/n'-^'^)neN converges if the complexity of L is 0(n'). 

Let us recall some notations. Let /(n) and g{n) be two functions, it 
is said that f{n) is 0{g{n)) if there exist positive constants c and uq such 
that for all n > uq, f{n) < cg{n); f{n) is Q{g{n)) if there exists a strictly 
positive constant c and an infinite sequence no, ni, . . . ,ni, . . . such that for 
all i £ N, f{ni) > cg{ni). The function /(n) is Q{g{n)) if f{n) is 0{g{n)) 
and n[g{n)). Let x £ A* and y G B* , with A and B two finite alphabets. 
If |x| = |y| + i, « G N then (x,?/)^ = {x,^^y) where 7^ is a new symbol 
which does not belong to AiJ B. If \y\ = \x\ + i then (x,?/)* = {^^x,y). 
This operation can be extended to n-uples of words. Let R he a relation 
over A* x B*. We say that R is regular if i?* is a regular language. This 
definition can be extended to n-ary relations. A map is regular if its graph 
is regular. 

Theorem 14 Let L C S* be a regular language such that piin) is 0(n') 
for some integer I. // A G N \ {n'+^ : n G N}, then there exists a subset X 
o/N such that ts{X) is regular and that r5(AX) is not. 

This theorem has a direct corollary. 

Corollary 15 Under the assumptions of Theorem the addition is not a 
regular map (i.e. the graph of the application {x,y) ^ x-\-y is not regular). 

Proof. By Theorem |l^, there exists a subset X of N such that X is S- 
recognizable and 2X is not. Assume that the graph of the addition 

Q = {{is{x),is{y),rs{x + y))* : X, y G N} 

is regular. Let p^ be the canonical homomorphism defined by pz{x, y, z) = z. 
It is clear that the set A = {(r5(x), rs'(x), w)* : x G X, to G S*} is regular. 
Therefore 

Ar^g = {{xs{x),Ts{x),Ts{2x))* :xGX} 

is regular. Thus p^{A f\Q) = r5(2X) is also regular, a contradiction. □ 
In the following, we will use the term of fc-tiered word and the results 



obtained in |14] about the complexity of regular polynomial languages. 
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The first lemma is just a refinement of |T^, Lemma 1]. We simply remark 
that one can consider an ultimately periodic sequence rii such that pL{ni) > 

0' 



bo ni. 



Lemma 16 If L is a regular language such that pii^) is G(n') for some 
integer I then there exist constants 69 '^'^^ C and an infinite sequence uq, 
Hi, ... ,ni, . . . such that for all i G N, Phini) > n\ and nj+i — Ui = C . 

Proof. It is obvious that there exists a word w £ L which is (/ + l)-tiered 
(see |14, Lemmas 2-4]), w = xy'^^ zi . . - Vi^i zi+i- Let C = \yi\ . . . \yi+i\- As 
shown in |14], there exists a constant 60 such that the number of words of 
length nt = \xzi . . . zi+i\ + t C is greater than bo n\ for any integer t. □ 

Recall (see [^) that the finite sum of integral powers is given by 

{n + B + l)P+i - BP+^ 



p + l 

where all terms of the form B"^ are replaced with the corresponding Bernoulli 
numbers Bm. This formula will be useful in the next lemma. 



Lemma 17 // PL{n) is 0(n') then Vn = ^11=0 Pl{''') "^^ Q{n^^'^)- More- 
over, there exists a constant J 
no,ni,... ,ni,... of Lemma ITE. 



over, there exists a constant J such that > for the sequence 



Proof, i) There exist A'^o and a constant hi such that for all n > Nq, pL{n) < 
bi n' . If one replaces 61 by a bigger constant then the latter inequality holds 
for all n. For n sufficiently large, there exists a constant K such that 



i=0 i=0 



ii) With the sequence Ui of Lemma 16, one has 

rii i i i 

j=0 j=0 j=0 j=0 

Since Ui = uq + iC, then rij is a linear function of i and for i large enough, 
there exists a constant J such that 
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□ 



So, at this stage, we have a sequence Ui such that ni = uq + iC and 
constants &0) ^i) a-nd J such that for n and i sufficiently large. 

Before going further in the proof of Theorem |l^, we give an interesting re- 
sult about the convergence of the sequence (-TTT)neN when L is a polynomial 
language. A remarkable fact is that the limit always exists. Although this 
is generally not the case for the sequence (^-^^)neN- Consider for instance 
the language W = a*b* n ({a, 6}^)*. It is obvious that /Oiy(2n + 1) =2n + 2, 
pw{2n) = and V2n = V2n+i = (n + 1)^. 

Lemma 18 Let pi, . . . , pk, Oi, . . . ,6k,^i, ■ ■ ■ be real numbers such that 
for all i 7^ j, 9i 7^ 9j and for all j, pj 7^ 0. There exists e > such that 



Mn = \pi eH""i-^-i> + 



for an infinite sequence of integers n 



+ Pte 



> e 



Proof. Assume that for all e > 0, M„ > e only for a finite number of 
integers n. In other words, M„ ^ 0. By successive applications of Bolzano- 
Weierstrass'theorem, there exist complex numbers zi,... ,Zk and a subse- 
quence k{n) such that 

„i(fe(n)6»j+$j) 



Since M„ 
manner 



PjC 

0, then 



Zj and \zj\ 



j=i 



Pj + 0. 

0. For Z = 0, . . . , — 1, one gets in the same 



k 

E 



0. 



Therefore one has 



1 



1 



Y gi(fc-l)6»i ^i{k-\)Bi, gj(fc-ijyfe J \ Zk J V / 

This equality leads to a contradiction since the Vandermonde determinant 
does not vanish. □ 



1 



j{k-i)ek 



\ / -1 \ 

Z2 

J \zk J 



/0\ 




We are now able to prove the convergence of [vn/n^^^] 
and its proof were suggested by P. Lecomte. 



This result 
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Theorem 19 If L is a regular language such that phin) is 0(n') then the 
sequence {-TfT)neN converges to a strictly positive limit. Moreover, 1 is 
a root of the characteristic polynomial of the sequence {pLin))nef^ with a 
multiplicity equal to I + 1. 



Proof. The sequence {pL{n))n€N satisfies a recurrence relation. Therefore, 
if Zi is a root of multiplicity Ui of the characteristic polynomial of {pL{n))n€N 
then one can write 



pUn)=Y,P^ 



i(n Zi 



(4) 



where Pi{n) is a polynomial of degree less than a^. Moreover pL{n) is 0(n^); 
in other words, we have a constant K such that 



PL{n) 
I 



< K. 



n' 



(5) 



This latter inequality has important consequences. 

i) We first show that \zi\ > 1 implies Pi = 0. Otherwise, let r = supj \ zi\ 
and d the maximal degree of polynomials Pi corresponding to the different 
roots of modulus r. So we can write 



PLin) 



ci e^"^i + 



In the last expression, is made up of two sorts of terms, namely 



where 



hd : C[z] ^ C[z] : P{z) ^ P{z) 



DiP{z) . 



z . 



So Rn ^ if n ^ +00. Therefore, by Lemma 18, there exists an infinite 
sequence of integers such that 



PL{n) 



> 



\Rn\)- 



For n large enough, \Rn\ < e/2 and 
often which contradicts @. 



> n'^ ' I occurs infinitely 



18 



ii) In the same way, one can verify that if | | = 1 then the degree of the 
corresponding polynomial Pi cannot exceed I. 

iii) If we are interested in the behaviour of Vn/'nf'^^ when n — > +00, then 
in the expression (H), we simply focus on the terms of the form 



for i such that \zi\ = 1. Indeed, any other term in pL{n) provides Vn/n^~^^ 
with a term which converges to zero (all these terms are included in R^)- 
So, if we assume that 1 = has a multiplicty / + 1 and if zi = e*^^ 



,. . ., 



zt = e*''* are the other roots of modulus one with qj = j = 0, . . . , t 

and ^0 = 0; then one can write 



with 9j 7^ and 



Therefore, it is easy to see that 

t ^ n ^ n n 

^St = E « si^s E +» E + E «i ■ 

j=l k=0 k=0 k=0 

Moreover, we see that 1 has, necessary, a multiplicity l+l; otherwise, -j^ 



0, which is a contradiction with Lemma 17. □ 



Proof of Theorem 14- By definition of a numeration system, it is clear 
that for n sufficiently large, ra + 1 < |r5(f„)| < n + C + 1 since for C 
consecutive values of PLi^) at least one of them does not vanish. (Notice 
that if pL{n) > for all n, then |r5(f„)| = n+1.) Recall also that |r5(a;)| = n 
iff Vn-i < X < Vn- In this proof, we use the sequence rij and the constants 
J, K, bo and bi introduced in the previous propositions. 

i) Assume that the integer constant A is strictly greater than ("j)'. We 
show that for n large enough, 

n + 1 < \Ts{Xvn)\ < [A^/'n] + C-K X^/^n + C. (6) 
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It is sufficient to show that Xvn < '^fxi/^ n']+c-i- Lemma 17, there exists 
k G {[A^/'n],... , [A^/'n] + C - 1} such that vj, > Jk'-+'^. Moreover the 
function n t;„ is increasing. So, 



t;rAiA„l+c-i>'^rAi/'nV+i> JA- 



i±i l + l 



Moreover, by Lemma 17, Av„, < Ai^'n^ . By the choice of A, it is clear 
that Ai^n'+i < JA^^n'+i. 



ii) In Lemma 16 and Lemma 17, we have introduced two constants bQ 
and bi such that 6o ^ ^i- 

Let s £ N\{0} such that sbo > bi. Here, we show that the function 

i ^ \isiXvn,,-i)\ 

is strictly increasing for i sufficiently large. So, we have to show that 

|rs(Afn,(,+ i)-l)| = \Tsi\Vn,,+sC-l)\ > ^5 (A Vn,,_l) | • 

Let k = |r5(A then Vk-i < Au„^._i < Vk and we must show that 

sC-l 



A 



A V. 



+ A ^ PL{nsi + j) >Vk = Vk-i + PL{k). 

j=0 



So, it is sufficient to show that A Ylj=o ^ PLi^si + j) > piik). In view of 
(I), k < \^/\n,i -l) + C. Therefore pL{k) < bi [Ai/'(n,i - 1) + C]'. On the 
other hand. 



sC-l s-l 

A PLjusi + j) > A y^_^pL{nsi + i C\ > Xbps 



To conclude this part, notice that the coefficient of n^j in 61 [A^/'(nsj — 1)+C]' 
is hi A and by choice of s, we have 61 A < A60 s. So the inequahty holds for 
i sufficiently large. 

iii) Consider the subset 

X = {vn,^^l : i G N} = {vna+siC-i : i e N}. 

Since pL^UQ+siC) > 0, then Ts{vno+siC-i) is the first word of length UQ+siC 
and 



r5(X)=r5 (K:nGN})nS"° (S^^)* 
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So X is a 5-recognizable subset of N [|n 

Assume that AX is recognizable. Therefore, |r5(AX)| is a finite union 
of arithmetic progressions. In view of ii), we can apply Lemma ^ and obtain 
two integral constants T and k such that for all a G N, 

|rs(Au„Q+sC{j+afc)-i)l = |r5(Afno+sCi-i)l +ar. 
Or equivalently, if we set z = \rs{^Vno+sCi-i)\ then 

Vz+ar~l < ^Vno+sC{i+ak)-l < Vz+aF- (7) 

First consider the left inequality in with the same argument as in i), we 
obtain 

Vz+aT-l > J{z + ar-Cy+\ 

On the other hand, 

A Vno+sC{i+a < A K (no + sCi + sCk a - 1)'+^ 

Since a can be arbitrary large, we focus on the terms of the form a'"^^. Then 
we obtain the following condition, 



J 



l+i 



jrW<A/a.«)«orA>-^— J . (8) 

If we consider the right inequality in @, we have Vz+aF < K {z + ar)'+^ 
and also 

A Vno+sC{i+a > A J (no + sCi + sCk a - C)'+^ . 
If we focus on terms in a'"*"^, we obtain 

K / T 



iv) By Theorem pj|, (^^)neN converges to a limit a > 0. Consider the 
sequences 

Km = a-\ and Jm = a . 

m m 

For a given m there exist im and rim such that for i > im, Vm > Jmn\'^^ 
and for n > n^, Vn < Km n'+^. So, if we replace K by Km and J by Jm, 
the previous points i), ii) and iii) remain true for n sufficiently large. 
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For m large enough, the condition A > y^f^j given in i) is equivalent to 
A > 2 and the conditions and @ may be replaced by a unique condition 




which contradicts the hypothesis (remember that F, s, C and k are integers). 
□ 



6 Multiplication and complement of polynomial 
languages 

In the previous sections, we have considered multiplication for numeration 
systems based on a polynomial language. If the complexity function of a 
regular language is not bounded by a polynomial then it is of order 2®^"') and 
the language is said to be exponential. The class of exponential languages 
splits into two subclasses according whether the complement of a language 
is polynomial or not. 

In this section, we have a closer look at numeration systems constructed 
on an exponential regular language such that its complement has a com- 
plexity function bounded by a polynomial. We show that for such systems, 
multiplication by a constant generally does not preserve recognizability. 

We begin with the example of S* \ L where L is the polynomial lan- 
guage a*h* and S = {o, 6}. Thus, with S = (S* \ L,{a, 6},a < 6), we 
compute the representations of 2 f „ and obtain Table ^ (for an algorithm of 
representation, see [^). 

In view of this table, it appears that the number of leading 6's in the 
representation is increasing. Furthermore, it seems that the length of the 
tail also increases. Let us show that this observation is true and can be 
generalized. 

Definition 20 Let L C S* and x G S*, we set L^; = {it; G L : u; = xy}. It 
is clear that Lx ^ L. So PL^in) < piin) and pi^ is 0{n^) whenever p^ is 
O(n'). 

In our example, for < /c < n, we have 

P(E.\L),„_, {n) = PT.l^_^ (n) - pL^„_, (n) = 2^-1. 
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n 


2Vn 




k 




1 





b 


1 





2 


2 


baa 


1 


1 


3 


10 


baab 


1 


2 


4 


32 


babab 


1 


3 


5 


84 


bbaaaa 


2 


3 


6 


198 


bbababa 


2 


4 


7 


438 


bbbaaabb 


3 


4 


8 


932 


bbbabbabb 


3 


5 


9 


1936 


bbbbabaaba, 


4 


5 


10 


3962 


bbbbbaabaaa 


5 


5 


11 


8034 


bbbbbabbbbab 


5 


6 


12 


16200 


bbbbbbabbaaab 


6 


6 


13 


32556 


bbbbbbbabaabaa 


7 


6 


14 


65294 


bbbbbbbbaababba 


8 


6 


15 


130798 


bbbbbbbbb aaaabbb 


9 


6 


16 


261836 


bbbbbbbbbabbbabbb 


9 


7 


17 


523944 


bbbbbbbbbbabbaabba 


10 


7 


18 


1048194 


bbbbbbbbbbb abababaa 


11 


7 


19 


2096730 


bbbbbbbbbbbb abaaaaab 


12 


7 


20 


4193840 


bbbbbbbbbbbbb aababbab 


13 


7 


21 


8388100 


bbbbbbbbbbbbbb aaabbaaa 


14 


7 



Table 1: first terms of 2 Vn for S = {{a, b}* \ a*b*, {a, b}, a<b). 
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The complexity function P(T,*\L){n) of the language associated to the system 
S is 2" — n — 1. So the sequence Vn associated to S* \ L is 

/-x on+1 n(n + 3) 
P{J:*\L){^) = 2"+' - ^ - 2. 

The words of r5({z;„ : n € N}) are the first words of each length in S* \ L. 
So {vn : n G N} is >S'-recognizable. Recall that |rs'(a;)| = n 4^ Vn-i < x < f„. 
For n large enough, it is obvious that fn < 2i;„ < Vn+i- Then |r5(2v„)| = 
n + 1. 

Let us show that {2vn : n G N} is not -S-recognizable. For each n there 
exists a unique i such that 

P(E*\L),„_i+i H = 2'-^ - 1 < Vn+l -2Vn<2' -1 = P(E*\L),„_i 

^ V 

=n(n+l)/2 

Then r5(2u„) = b"'~'^az with = i. Notice that, as a function of n, i 
is increasing but grows more slowly than n (in fact, it has a logarithmic 
growth). So n — i ^ +oo. 

Assume that C = r5'({2f„ : n G N}) is accepted by an automaton with 
q states. There exist no, io and t > such that r5(2v„o) = W'^^azo with 
1^0 1 = ^0- By the pumping lemma, there exists a > such that 

Vm G N, 6«+*+"'"a^o G JC. 

In this last expression, zq has a constant length iq independent of m. A 
contradiction. 

In view of this example, we state the following theorem. Recall that the 
complexity of any polynomial language is 6(n') for some I. 

Theorem 21 Let = {ai <■■■ < ag-i < /3}, s > 2 and L C S* 6e a 
regular language such that pL{fi) is G(n'). If S = (S* \ L, S, <) then there 
exists an S -recognizable set X G N such that for all j > 1, s^X is not 
S -recognizable. 

Proof. For < A; < n, we have 

P(E*\L)^„_, (n) = PE*„_, (n) - PL^„_, (n) = s'' - pL^„_, (n) . 

GO(n') 
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To avoid any misunderstanding, Vn is the sequence associated to the lan- 
guage S* \ L of the numeration S and Vn{L) is related to L. So, vl^ti) = 
TJl=QPL{i) and 

Vn = ^P(T.*\L){i) = VL{n). 

1=0 * 

We take X = r5({u„ : n £ N}), an 5-recognizable set. We have, for n 
sufficiently large, 

Vn+j-l < S^Vn < Vn+j- 

Indeed, Vn+j — w„ = viin) — viin + j) + ^rr- Theorem |l^, there 
exists a > such that ^^(n) ~ an'+^. So Vn+j — ~ {s^ — l)an'^^. On 
the other hand, Vn — Vn+j-i = s^^^ + VL{n + j — 1) — viin) — ^^Ey has 
an exponential dominant term. Then \i^s{s^ Vn)\ = n + j . 

For all n sufficiently large, there exists a unique i such that 

^ V ' ^ V ' 

Then r5(s-^ Vn) = /3"~*(tz with |z| = i + j — 1 and a ^ [5. Notice that as a 
function of n, i is increasing and not bounded. To show that n — i ^ +oo 
if n — > +00. Assume that n — i is bounded, divide all members of ( p!o| ) by 
s". Let n +00 and obtain a contradiction. 

Suppose that rg'ds-'X}) is accepted by an automaton with q states. 
There exist ng, io and t > such that Tsis-'Vno) = (3'^~^^azQ with \zq\ = iq 
and a ^ f3. Then using the pumping lemma, we obtain a contradiction. □ 

7 Relation with positional numeration systems 

In this section, we give sufficient conditions to achieve the computation 
of an [/-representation of an integer from its S-representation, where U is 
some positional numeration system related to a sequence of integers. In 
particular, we obtain sufficient conditions to guarantee the stability of the 
S'-recognizability after addition and multiplication by a constant. 

Let us recall some definitions. A 2-tape automaton over A* x B* (also 
called transducer) is a directed graph with edges labelled by elements of 
A* X B*. The automaton is finite if the set of edges is finite. A 2-tape 
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automaton is said letter-to-letter if the edges are labelled by elements of 
A X B. A relation R C A* x B* is said to be computable by a finite 2-tape 
automaton if there exists a finite 2-tape automaton over A* x B* such that 
the set of labels of paths starting in an initial state and ending in a final 
state is equal to R. Finally, a function is computable by a finite 2-tape 
automaton if its graph is computable by a finite 2-tape automaton. 

Definition 22 If U = {Un)neN is a sequence of integers and x = Xn ■ ■ ■ xq, 
a word over an alphabet B C Z. We define the numerical value of x as 

n 

T^uix) = y^^xjUj. 

1=0 

Notice that different words can have the same numerical value. 

Proposition 23 Let L G T,* be a regular language, M = (ii', s, F, S, 5) be 
a DFA accepting L and S = (L,S,<). Let U = {Un)ne'N be a sequence of 
integers such that Uq = 1. If there exist k,a £ N \ {0}, Cp^i £ Z (p £ K, 
i = 0, . . . ,k — 1) such that for all state p £ K and all n £ N 

k-l 

aun+k-i{p) = X^ep^j Un+i- (11) 

Then there exist a finite alphabet B C Z and a finite letter-to-letter automa- 
ton which compute a function g : L ^ B* such that \w\ = \g{w)\ and 

avals{w) = TTu{g{w)). 

Remark 2 The function g of the previous theorem is injective. If v and w 
are two words of L such that g{v) = g{w) then vals{v) = vals{w). So the 
conclusion, since val^ is a one-to-one correspondence. 

Proof. We consider words of length at least k. Indeed, there is only a finite 
number of words of length less than k and they can be treated separately. 
Let w = Wk+i ■ ■ ■ Wk-iWk-2 ■ ■ ■ wq he a word of L of length k -\- 1 + 1 with 
/ > — 1. We compute Z + 2 applications of Lemma |3| on vals{w) and we obtain 

Uk+i{s.a) + '^Uk+i{s) -\- ^ Uk+iis.Wk+i . . .Wk+i+icr) 

(T<Wk+i i=-l i=-l cr<Wk+i 

+yals.wt,+i...wt:_-, {Wk-2 ...Wo)+ Vk-2{s) - Vk-2{s.Wk+l ■ ■ ■ l^fc-l). 



26 



Recall that the notation p. a is written in place of 5{p, a). We will denote by 
Cw the sum of the last three terms. For aW q K , p K \ {s} and a £ T,, 
let us define 

f3q,p,a = #W < o- : q.a' = p} 

and 

Pq,s,a = 1 + #W < a : q.a' = s}. 
With these notations, we can rewrite vals{w) as 

l-i 

Cw + ^y^^Ps,p,Wf,+i Uk+l{p) + ^y^^Ps.Wf,+i...Wk+i-i,p,Wk+i Uk+iip)- 

peK i=-lp<=K 

Therefore, using (11), we have 

fc-i 

a vals{w) = aCw + ^ ^Ps,p,w,,+i dpj 

j=0 p<^K 

^ V ' 

l-l k-l 
i=-l j=0 p£K 



It is obvious that the Ajj's take their values in a finite set R. Therefore sums 
of A; — 1 elements of R also take their values in a finite set, say T. Notice 
that the Ajj 's (resp. the A^j's) are completely determined by the letter Wk+i 
(resp. Wk+i) and the state s.Wk+i ■ ■ ■ Wk+i-i reached after the lecture of the 
first letters of w (resp. the state s). Therefore, we extend the notation A^j 
to a meaningful one: 

^q,cr,j — ^ ^ Pq,p,(J ^p,j (1^) 
p€K 

with q £ K , a £ T, and j = 0, . . . ,k — 1. 

We are now able to build a finite letter-to-letter 2-tape automaton M 
over S* X B* with B C 1^ some finite alphabet. The formula expressing 
avals (tu) can be interpreted in the following way. The reading of Wk+i, 
I < i < —1, provides the decomposition of avals{w) with Xi±^iUk+i; 
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Ai,A:-2 Uk+i-i; ■ ..; Xifi Ui+i. The reading of Wk+i gives a coefficient Xi^k-i for 
Uk+i- The other k — 1 coefficients can be viewed as "remainders". Roughly 
speaking, if we have already read the word t = Wk+i ■ ■ ■ Wk+i+i and if we 
are reading a = Wk+i, then we have to consider the state s.t. (Therefore 
it seems natural to mimic M in M.) The coefficients A,_fc_i;... ; Aj.o are 
nothing else but Xs.t,a,k-i; • • • ; Xs.t,a,o- 

Thereby we can give a precise definition of M. The set of states is 
fC = KU {/} X T X ■ ■ ■ X T where / does not belong to K and is the unique 
fc-1 

final state of M. The copies of T will be used to store the "remainders". 
The start state is (s, 0, . . . ,0). The transition relation A : /C x (S x i?) — > /C 
is defined as follows. If p & K , a £ T,, 

7fc-2, • • • , 70), (cr, Xp^a,k-1 + 7fc-2)) 

= {p-o", Ap,o-,fc-2 + 7fc-3; • • • ; Ap,o-,i + 7o; Xp^cr,o) 

These transitions compute an output Xk+i ■ ■ ■ Xk-i from Wk+i ■ ■ ■ Wk-i- The 
alphabet B is finite since T is finite. 

But we have still to read the last fc — 1 letters of w. For each state p € K, 
Dp = LpH E'^"^ is finite (recall that Lp are the words accepted from p). So, 
for each state p £ K and each word Wk-2 ■ ■ -Wq £ Dp, we construct an edge 
from (p,7fc-2, . . . ,7o) to / labelled by {wk-2 ■ . .i(;o,7fc-2 • • -71(70 + C'«i))- 
(This kind of edge can naturally be split in k — 1 elementary edges using 
k — 2 new states.) Indeed, notice that Cw is a constant which only depends 
on the state s.Wk+i ■ ■ ■ Wk-i reached (the first component in fC) and the 
remainding word Wk-2 ■ ■ -Wq. □ 



Remark 3 The states of M satisfy the same recurrence relation of degree 
I. A practical way to check (|TT1) is to seek a final state f £ F such that 



/ ^o(/) 



det 



ni-iif) 



V ui^iif) ■■■ U21-2U) 
If such an / exists then for all p £ K, there exist Cp.j G Q such that 



i+l-lip) = ^^Cp^iUn+iif) 

and (|Tl| ) can be easily obtained. 



i=0 
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Recall that a strictly increasing sequence U = {Un)nm of integers such 
that [/q = 1 and -^ff-^ is bounded, defines a positional numeration system. If 
X is an integer, the U -representation of x obtained by the greedy algorithm is 
denoted by pu{x) and belongs to where Au = {0, . . . , Q} is the canonical 
alphabet of the system U, Q < max A set X C N is said U -recognizable 

if pu{X) is regular. For any alphabet C of integers, one can define a partial 
function called normalization 

vu,c ■.C*^A*u:z^ pu{ttu{z)). 



Corollary 24 Let S = (L,S,<). With the hypothesis and notations of 
Proposition 2S, if the sequence U defines a positional numeration system 
such that the normalization function i'u,B is computable by finite letter-to- 
letter 2-tape automaton then X C N is S -recognizable if and only if aX is 
U -recognizable. 



Proof. Let the regular language Q C x B)* he the graph of the function g 
defined in Proposition We denote by pi : S x i? — > S and p2 : T,xB ^ B 
the canonical homomorphisms of projection. Let 

Y = p2[p^H^s{x))ng]. 

If X is S-recognizable then Y C B* is regular and Tru{Y) = aX. So aX is 
[7-recognizable since lyuBiY) is regular. 

Conversely, if pu{aX) is regular then u^^^ o pij{aX) is also regular. For 
each y G aX, z^^^ o pu{y) can take more than one value but only one is in 
P2{G)- So the set 

Pi {P2^[^U^B ° Pu{(^X)] n a) 
is regular and equal to ^:s{X). □ 



Corollary 25 Let S = (L,S,<). With the hypothesis and notations of 
Proposition if the sequence U satisfies a linear recurrence relation 

Un = diUn-1 H h dmUn-m, di £ Z, dm ^ 0, n > m 

such that its characteristic polynomial is the minimal polynomial of a Pisot 
number then X C N is S -recognizable if and only if X is U -recognizable. 
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Proof. It is well known that for such a system U the normalization uu^c 
is computable by finite letter-to-letter 2-tape automaton for any alphabet 
C (see HI). So by the previous corollary, X is S'-recognizable if and only if 
aX is [/-recognizable. Another well-known fact related to Pisot numeration 
systems is that a subset X is [/-recognizable if and only if it is definable in 
the structure (N, + ,Vu) (see Q). In particular, multiplication by a constant 
a is definable in (N, -|-). So aX is definable in the structure if and only if X 
is definable. □ 



Remark 4 Let S = (L,S,<) and S' = (L,S,^) be two systems which 
only differ by the ordering of the alphabet. If the hypothesis of Proposition 
p3| and Corollary 24 are satisfied then a set X is S'-recognizable if and only 
if it is S"-recognizable. In other words, recognizable sets are independent of 
the ordering of the alphabet. 



Example 2 Consider the language L C {a,b,c}* of the words that do not 
contain aa. Its minimal automaton is given on Figure ||. As usual, the 
start state is indicated by an unlabeled arrow and the final states by double 
circles. The sequences associated to the different states satisfy the relation 




Figure 1: The minimal automaton of L. 



Un+2 = 2n„+i + 2u„, Vn € N 

with the initial conditions uq{s) = 1, ui{s) = 3, uo{t) = 1, ui{t) = 2, 
'"o(p) = ui{p) = 0. The sequence U of Proposition ^ can be played by 
(n„(s))„i=N- For all n G N, we have the relations 

Un+l{s) = lUn+l{s) +QUn{s) =^ 65,0 = 0, 6^,1 = 1 
Un+l{t) =QUn+l{s) +2Un{s) =^ etfl = 2, 64,1 = 

n„+i(p) = Oun+i{s) + Oun{s) =^ Cpfl = 0, ep,i = 

Notice that the characteristic polynomial of the recurrence satified by Un{s) 
is — 2x — 2 = {x — 1 + V3){x — 1 — -v/3). So [/ = (n„(s))„gN is a positional 



30 



numeration system associated to the Pisot number 1 + -v/3- From M^, we 
compute the 3x3 matrices B^j = {(3q,r,a)q,r=s,t,p, cr £ T, : 










0^ 




Ba = 



















) 





1 1 
1 1 \ ,B, 
1 1 



2 1 
2 
1 1 2 



If E = {eq^i)q=s,t,p;i=o,i then it follows from ( |12| ) that {B(^E)q^i = \q,cr,i- We 
have 



BaE 



1 

1 I , Bfe^ 
1 



2 1 

1 \ ,B^E 
1 



2 2 
2 
2 1 



To obtain the complete transducer, with the notations of the proof of Propo- 
sition we have to compute the Cy^ namely 

Cg,a = valq(fT) + vq{s) - voiq) 

for q and a such that q.a G -F. Finally we have on Figure |2| the finite 
letter-to-letter automaton build from Ml and the Xq^a/s. 




3,3 



Figure 2: The transducer computing g. 

We can do the same construction for the language L' = a~^{a,b}*. Its 
minimal automaton M/,/ is given on Figure ^. The seqence U of Proposition 
23 can be played by Un{t) = 2". So here, the Pisot number involved is 2 
and it is multiplicatively independent with 1 + \/3. So from [|lO|, the only 
subsets which are simultaneously recognizable in (L, {a, b,c},a < b < c) and 
(L', {a,b},a < b) are the arithmetic progressions. 
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Figure 3: The minimal automaton of L' = a^{a, 6}*. 

Remark 5 Let J = a{a, 6}*U{a, 6}*66{a, 6}*. Notice that J is an exponen- 
tial language with exponential complement. Its minimal automaton Mj is 
given on Figure |^. We consider the numeration system S = (J, {a, h}, a < b) 




Figure 4: The minimal automaton of J = a{a, b}* U {a, b}*bb{a, b}* . 
and we show that 

i) we cannot find a linear recurrent sequence associated to a Pisot number 
such that the condition (^) of proposition ^ is satisfied for all state 
of Mj 

ii) the set X = {vn{s) : n € N} is S'-recognizable but 2X is not. 
One can check that for all n > 1, n„(t) = 2" and 



So i) holds. To check ii), we use the same technique as in Theorem 21. One 
can verify that 

has an exponential dominant term. Furthermore, for all n large enough 
there exists i such that 
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and n — i ^ +00 if n ^ +cxd. One can conclude as in Theorem 
ics{'^Vn{s)) = Uaz with \z\ = n — i — 1. 
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